ALIZE/spkdet: a state-of-the-art open source software for speaker recognition

نویسندگان

  • Jean-François Bonastre
  • Nicolas Scheffer
  • Driss Matrouf
  • Corinne Fredouille
  • Anthony Larcher
  • Alexandre Preti
  • Gilles Pouchoulin
  • Nicholas W. D. Evans
  • Benoit G. B. Fauve
  • John S. D. Mason
چکیده

This paper presents the ALIZE/SpkDet open source software packages for text independent speaker recognition. This software is based on the well-known UBM/GMM approach. It includes also the latest speaker recognition developments such as Latent Factor Analysis (LFA) and unsupervised adaptation. Discriminant classifiers such as SVM supervectors are also provided, linked with the Nuisance Attribute Projection (NAP). The software performance is demonstrated within the framework of the NIST’06 SRE evaluation campaign. Several other applications like speaker diarization, embedded speaker recognition, password dependent speaker recognition and pathological voice assessment are also presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ALIZE 3.0 - open source toolkit for state-of-the-art speaker recognition

ALIZE is an open-source platform for speaker recognition. The ALIZE library implements a low-level statistical engine based on the well-known Gaussian mixture modelling. The toolkit includes a set of high level tools dedicated to speaker recognition based on the latest developments in speaker recognition such as Joint Factor Analysis, Support Vector Machine, i-vector modelling and Probabilistic...

متن کامل

Application of automatic speaker recognition techniques to pathological voice assessment (dysphonia)

This paper investigates the adaptation of Automatic Speaker Recognition (ASR) techniques to the pathological voice assessment (dysphonic voices). The aim of this study is to provide a novel method, suitable for keeping track of the evolution of the patient’s pathology: easy-to-use, fast, non-invasive for the patient, and affordable for the clinicians. This method will be complementary to the ex...

متن کامل

Inter and Intra-speaker Variability in French: An Analysis of Oral Vowels and Its Implication for Automatic Speaker Verification

Intra and inter-speaker variability is studied as a way to better understand how voice can be used as biometric data. Formant values from 328,016 exemplars of the 10 French oral vowels uttered by 111 speakers were compared to estimate their speaker discrimination power. The vowels /œ/, /ɛ/ and /a/ appear to convey more idiosyncratic information than other oral vowels. A more comprehensive phone...

متن کامل

The RWTH aachen university open source speech recognition system

We announce the public availability of the RWTH Aachen University speech recognition toolkit. The toolkit includes state of the art speech recognition technology for acoustic model training and decoding. Speaker adaptation, speaker adaptive training, unsupervised training, a finite state automata library, and an efficient tree search decoder are notable components. Comprehensive documentation, ...

متن کامل

An Attack on a Text-independent Speaker Authentication System

We mount an effective attack on a third-party open-source text-independent speaker verification system. Specifically, we show how an attacker can simply use a signal generated at fixed frequency to pass speaker verification and gain access to other user accounts. We demonstrate this attack on the GMM-UBM based ALIZE speaker verification system using the YOHO voice database. We show through expe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008